Knowledge Acquisition from Distributed, Autonomous, Semantically Heterogeneous Data and Knowledge Sources (KADASH)

نویسندگان

  • Doina Caragea
  • Vasant Honavar
  • Raghu Ramakrishnan
  • Seung-won Hwang
  • Mahmood Hossain
  • Susan Bridges
  • Yong Wang
  • Zonghuan Wu
  • Vijay Raghavan
  • Weiyi Meng
  • Hongkun Zhao
  • Samir Tartir
  • I. Budak Arpinar
  • Michael Moore
  • Amit P. Sheth
  • Boanerges Aleman-Meza
  • Takahiro Kosaka
  • Susumu Date
  • Hideo Matsuda
  • Shinji Shimojo
  • Liviu Badea
  • Sally McClean
  • Bamshad Mobasher
  • Pawan Lingras
  • Jie Bao
  • Facundo Bromberg
  • Cornelia Caragea
  • Dae-Ki Kang
  • Neeraj Koul
  • Jyotishman Pathak
  • Oksana Yakhnenko
  • Flavian Vasile
چکیده

ion. For example, the program of study a student in a data source can be specified as Graduate Program (higher level of abstraction), while the program of study of a different student in the same data source (or even a different data source) can be specified as Doctoral Program (lower level of abstraction). 2005 IEEE ICDM Workshop on KADASH 5 The workshop brings together researchers in relevant areas of artificial intelligence (machine learning, data mining, knowledge representation, ontologies), information systems (information integration, databases, semantic Web), distributed computing (service-oriented computing) and selected application areas (e.g., bioinformatics, security informatics, environmental informatics) to address several questions such as: • What are some of the research challenges presented by emerging data-rich application domains such as bioinformatics, health informatics, security informatics, social informatics, environmental informatics? • How can we perform knowledge discovery from distributed data (assuming different types of data fragmentation, e.g., horizontal or vertical data fragmentation; different hypothesis classes, e.g., naïve Bayes, decision tree, support vector machine classifiers; different performance criteria, e.g., accuracy versus complexity versus reliability of the model generated, etc.)? • How can we make semantically heterogeneous data sources self-describing (e.g., by explicitly associating ontologies with data sources and mappings between them) in order to help collaborative scientific discovery from autonomous information sources? • How can we represent, manipulate, and reason with ontologies and mappings between

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Algorithms and Software for Collaborative Discovery from Autonomous, Semantically Heterogeneous, Distributed Information Sources

Development of high throughput data acquisition technologies, together with advances in computing, and communications have resulted in an explosive growth in the number, size, and diversity of potentially useful information sources. This has resulted in unprecedented opportunities in data-driven knowledge acquisition and decisionmaking in a number of emerging increasingly data-rich application ...

متن کامل

Information Integration and Knowledge Acquisition from Semantically Heterogeneous Biological Data Sources

We present INDUS (Intelligent Data Understanding System), a federated, query-centric system for knowledge acquisition from autonomous, distributed, semantically heterogeneous data sources that can be viewed (conceptually) as tables. INDUS employs ontologies and inter-ontology mappings, to enable a user or an application to view a collection of such data sources (regardless of location, internal...

متن کامل

Learning classifiers from distributed, semantically heterogeneous, autonomous data sources

Recent advances in computing, communications, and digital storage technologies, together with development of high throughput data acquisition technologies have made it possible to gather and store large volumes of data in digital form. These developments have resulted in unprecedented opportunities for large-scale data-driven knowledge acquisition with the potential for fundamental gains in sci...

متن کامل

Towards Semantics-Enabled Distributed Infrastructure for Knowledge Acquisition

We summarize progress on algorithms and software knowledge acquisition from large, distributed, autonomous, and semantically disparate information sources. Some key results include: scalable algorithms for constructing predictive models from data based on a novel decomposition of learning algorithms that interleaves queries for sufficient statistics from data with computations using the statist...

متن کامل

Decision Tree Induction from Distributed Heterogeneous Autonomous Data Sources

With the growing use of distributed information networks, there is an increasing need for algorithmic and system solutions for data-driven knowledge acquisition using distributed, heterogeneous and autonomous data repositories. In many applications, practical constraints require such systems to provide support for data analysis where the data and the computational resources are available. This ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2005